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1 IR-2 (information retrieval): web information retrieval: A practical web-based 

approach to generating topic hierarchy for text segments 
^ Shui-Lung Chuang, Lee-Feng Chien 

November 2004 Proceedings of the thirteenth ACM international conference on 

Information and knowledge management CIKM '04 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: ^pdf(351 .23 KB) 



It is crucial in many information systems to organize short text segments, such as 
keywords in documents and queries from users, into a well-formed topic hierarchy. In this 
paper, we address the problem of generating topic hierarchies for diverse text segments 
with a general and practical approach that uses the Web as an additional knowledge 
source. Unlike long documents, short text segments typically do not contain enough 
information to extract reliable features. This work investigates the p ... 

Keywords: clustering, partitioning, search-result snippet, text segment, topic hierarchy 
generation, web data mining 



2 Web: Building a web thesaurus from web link structure 

Zheng Chen, Shengping Liu, Liu Wenyin, Geguang Pu, Wei-Ying Ma 
v' July 2003 Proceedings of the 26th annual international ACM SIGIR conference on 
Research and development in informaion retrieval 

Publisher: ACM Press 

Full text available: fQ pdf(292.05 KB) Additional Information: full citation , abstract , references , citings , index 

'' terms 

Thesaurus has been widely used in many applications, including information retrieval, 
natural language processing, and question answering. In this paper, we propose a novel 
approach to automatically constructing a domain-specific thesaurus from the Web using 
link structure information. The proposed approach is able to identify new terms and 
reflect the latest relationship between terms as the Web evolves. First, a set of high 
quality and representative websites of a specific domain is selected. ... 

Keywords: content structure, link analysis, query expansion, thesaurus 
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Learning search engine specific query transformations for question answering 
Eugene Agichtein, Steve Lawrence, Luis Gravano 

April 2001 Proceedings of the 10th international conference on World Wide Web 
Publisher: ACM Press 

Full text available: *^ pdf(205.68 KB) Additional Information: full citation , references , citings , index terms 



Keywords: Information retrieval, query expansion, question answering, web search 



4 Industrial and government applications track posters: Identifying "best bet" web 
^ search results by mining past user behavior 
^ Eugene Agichtein, Zijian Zheng 

August 2006 Proceedings of the 12th ACM SZGKDD international conference on 

Knowledge discovery and data mining KDD '06 
Publisher: ACM Press 

Full text available: ^pdf(829.11 KB) Additional Information: full citation , abstract , references , index terms 

The top web search result is crucial for user satisfaction with the web search experience. 
We argue that the importance of the relevance at the top position necessitates special 
handling of the top web search result for some queries. We propose an effective approach 
of leveraging millions of past user interactions with a web search engine to automatically 
detect "best bet" top results preferred by majority of users. Interestingly, this problem 
can be more effectively addressed with classificatio ... 

Keywords: user behavior mining, web search ranking, web usage mining 



5 Corrigenda: a hierarchy-aware approach to faceted classification of object-oriented Q 
^ components 

^ E. Damiani, M. G. Fugini, C. Bellettini 

October 1999 ACM Transactions on Software Engineering and Methodology (TOSEM), 

Volume 8 Issue 4 
Publisher: ACM Press 

Full text available: fBpdff310.50 KB) Additional Information: full citation , abstract, references , citings, index 
^ terms 

This article presents a hierarchy-aware classification schema for object-oriented code, 
where software components are classified according to their behavioral characteristics, 
such as provided services, employed algorithms, and needed data. In the case of reusable 
application frameworks, these characteristics are constructed from their model, i.e., from 
the description of the abstract classes specifying both the framework structure and 
purpose. In conventio ... 

6 A hierarchy-aware approach to faceted classification of objected-oriented Q 
components 

E. Damiani, M. G. Fugini, C. Bellettini 

July 1999 ACM Transactions on Software Engineering and Methodology (TOSEM), 

Volume 8 Issue 3 
Publisher: ACM Press 

Full text available: 1jg) D df(310.25 KB) Additional Information: full citation , abstract, references , citings, index 

terms , review 

This article presents a hierarchy-aware classification schema for obje ct-oriented code, 
where software components are classified according to their behavioral characteristics, 
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such as provided services, employed algorithms, and needed data. In the case of reusable 
application frameworks, these characteristics are constructured from their model, i.e., 
from the description of the abstract classes specifying both the framework structure and 
purpose. In conven ... 

Keywords: code analysis, component repositories, component retrieval, software reuse, 
user feedback 



7 Relevance feedback: Using web-graph distance for relevance feedback in web 
^ search 

^ Sergei Vassilvitskii, Eric Brill 

August 2006 Proceedings of the 29th annual international ACM SIGIR conference on 
Research and development in information retrieval SIGIR '06 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , index terms 

We study the effect of user supplied relevance feedback in improving web search results. 
Rather than using query refinement or document similarity measures to rerank results, 
we show that the web-graph distance between two documents is a robust measure of 
their relative relevancy. We demonstrate how the use of this metric can improve the 
rankings of result URLs, even when the user only rates one document in the dataset. Our 
research suggests that such interactive systems can significantly improv ... 

Keywords: link analysis, relevance feedback, web search 
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The use of dynamic contexts to improve casual internet searching 
Gondy Leroy, Ann M. Lally, Hsinchun Chen 

July 2003 ACM Transactions on Information Systems (TOIS), Volume 21 issue 3 
Publisher: ACM Press 

Full text available: ^ pdf(231 .61 KB) Additional Information: full citation , abstract , references , index terms 

Research has shown that most users' online information searches are suboptimal. Query 
optimization based ,on a relevance feedback or genetic algorithm using dynamic query 
contexts can help casual users search the Internet. These algorithms can draw on implicit 
user feedback based on the surrounding links and text in a search engine result set to 
expand user queries with a variable number of keywords in two manners. Positive 
expansion adds terms to a user's keywords with a Boolean "and," negative ... 

Keywords: Information retrieval, Internet, automatic query expansion, genetic 
algorithm, implicit user feedback, personalization, relevance feedback 



An information-theoretic approach to automatic query expansion 

Claudio Carpineto, Renato de Mori, Giovanni Romano, Brigitte Bigi 

January 2001 ACM Transactions on Information Systems (TOIS), Volume 19 issue l 

Publisher: ACM Press 

Full text available* f Sjl pdf(414 88 KB) Addjtional Information: full citation , abstract , references , citings , index 
' : terms , review 

Techniques for automatic query expansion from top retrieved documents have shown 
promise for improving retrieval effectiveness on large collections; however, they often 
rely on an empirical ground, and there is a shortage of cross-system comparisons. Using 
ideas from Information Theory, we present a computationally simple and theoretically 
justified method for assigning scores to candidate expansion terms. Such scores are used 
to select and weight expansion terms within Rocchio's framework ... 
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Keywords: automatic query expansion, information retrieval, information theory, 
pseudorelevance feedback 



10 lR-6 (information retrieval): digital libraries: SERF: integrating human 
recommendations with search 

Seikyung Jung, Kevin Harris, Janet Webster, Jonathan L. Herlocker 
November 2004 Proceedings of the thirteenth ACM international conference on 

Information and knowledge management CIKM '04 
Publisher: ACM Press 

Full text available: pdf(443.46 KB) Additional Information: full citation , abstract , references , index terms 

Today's university library has many digitally accessible resources, both indexes to content 
and considerable original content. Using off-the-shelf search technology provides a single 
point of access into library resources, but we have found that such full-text indexing 
technology is not entirely satisfactory for library searching. 

In response to this, we report initial usage results from a prototype of an entirely new 
type of search engine - The System for Electronic Recommendation Fi ... 

Keywords: collaborative filtering, digital libraries, information retrieval, user studies, web 
search 



11 The paraphrase search assistant: terminological feedback for iterative information 
^ seeking 

^ Peter G. Anick, Suresh Tipirneni 

August 1999 Proceedings of the 22nd annual international ACM SIGIR conference on 
Research and development in information retrieval 

Publisher: ACM Press 

Full text available: | §pdf(1 16.89 KB) Additional Information: full citation , references , citings , index terms 



Keywords: data visualization, query reformulation, terminological feedback 



12 The potential and actual effectiveness of interactive query expansion 
Mark Magennis, Cornelis J. van Rijsbergen 

July 1997 ACM SIGIR Forum , Proceedings of the 20th annual international ACM 

SIGIR conference on Research and development in information retrieval 
SIGIR '97, Volume 31 Issue SI 
Publisher: ACM Press 

Full text available: fj|pdf(1.23 MB) Additional Information: full citation , references , citings , index terms 



13 A semantic-based approach to component retrieval 
^ Vijayan Sugumaran, Veda C. Storey 
^ August 2003 ACM SIGMIS Database, volume 34 issue 3 
Publisher: ACM Press 

Full text available: j jg| pdf(367.67 KB) Additional information: full citation, abstract, references , citings, index 
' ^ : terms 

There continues to be a great deal of pressure to design and develop information systems 
within a short period of time. This urgency has reinvigorated research on software reuse, 
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particularly in component based software development. One of the major problems 
associated with component-based development is the difficulty in searching and retrieving 
reusable components that meet the requirement at hand. In part, this problem exists 
because of the lack of sophisticated query methods and techniques. ... 

Keywords: component based development, domain model, ontology, reuse repository, 
systems development 



14 Evaluating user interfaces to information retrieval systems: a case study on user 
^ support 

^ Giorgio Brajnik, Stefano Mizzaro, Carlo Tasso 

August 1996 Proceedings of the 19th annual international ACM SIGIR conference on 

Research and development in information retrieval 
Publisher: ACM Press 

Full text available: ^ pdf(1.29 MB) Additional Information: full citation , references , citings , index terms 



15 Query result processing: Mining anchor text for query refinement Q 
Reiner Kraft, Jason Zien 

May 2004 Proceedings of the 13th international conference on World Wide Web 

Publisher: ACM Press 

Full text available:^ pdfd 00.27 KB) Additional Information: full citation , abstract , references , index terms 

When searching large hypertext document collections, it is often possible that there are 
too many results available for ambiguous queries. Query refinement is an interactive 
process of query modification that can be used to narrow down the scope of search 
results. We propose a new method for automatically generating refinements or related 
terms to queries by mining anchor text for a large hypertext document collection. We 
show that the usage of anchor text as a basis for query refinement produce ... 

Keywords: anchor text, query refinement, rank, web search 




16 Information retrieval session 1: adhoc retrieval: Using titles and category names from 
editor-driven taxonomies for automatic evaluation 
Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, David Grossman 

November 2003 Proceedings of the twelfth international conference on Information 

and knowledge management 
Publisher: ACM Press 

Full text available: fiQ pdf(249.41 KB) Additional Information: full citation , abstract , references , citings , index 
. s terms 

Evaluation of IR systems has always been difficult because of the need for manually 
assessed relevance judgments. The advent of large editor-driven taxonomies on the web 
opens the door to a new evaluation approach. We use the ODP (Open Directory Project) 
taxonomy to find sets of pseudo-relevant documents via one of two assumptions: 1) 
taxonomy entries are relevant to a given query if their editor-entered titles exactly match 
the query, or 2) all entries in a leaf-level taxonomy category are rele ... 

Keywords: automatic evaluation, relevance judgments, web search 

17 NSF workshop on industrial/academic cooperation in database systems 
Mike Carey, Len Seligman 
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March 1999 ACM SIGMOD Record, Volume 28 Issue 1 
Publisher: ACM Press 

Full text available: |S|pdf(1.96 MB) Additional Information: full citation , index terms 




18 Semantic web applications: CS AKTive space: representing computer science in the 
semantic web 

m. c. schraefel, Nigel R. Shadbolt, Nicholas Gibbins, Stephen Harris, Hugh Glaser 
May 2004 Proceedings of the 13th international conference on World Wide Web 
Publisher: ACM Press 

Full text available: ^ pdf(346.31 KB) Additional Information: full citation , abstract , references , index terms 

We present a Semantic Web application that we callCS AKTive Space. The application 
exploits a wide range of semantically heterogeneousand distributed content relating to 
Computer Science research in theUK. This content is gathered on a continuous basis using 
a variety of methods including harvesting and scraping as well as adopting a range 
models for content acquisition. The content currently comprises aroundten million RDF 
triples and we have developed storage, retrieval andmaintenance methods ... 

Keywords: ontologies, semantic web, semantic web challenge groups 



19 Time, media, interaction: A look at some issues during textual linking of 
homogeneous web repositories 

Jose Antonio Camacho-Guerrero, Alessandra Alaniz Macedo, Maria da Graga Campos 
Pimentel 

October 2004 Proceedings of the 2004 ACM symposium on Document engineering 
Publisher: ACM Press 

Full text available: |I| pdf(993.53 KB) Additional Information: full citation , abstract , references , index terms 

Interacting with services that create links automatically via Web users are able to identify 
relationships among documents stored in different repositories. The fact that automatic 
linking services do not use queries performed by a human user has impact in the use of 
information retrieval techniques for the identification of relationships. Information 
retrieval techniques can lead to the identification of relationships that should not have 
been generated (generating non-relevant links) at the ... 

Keywords: homogeneous repositories, information retrieval, linking, semantic structures, 
web 



20 Search improvement via automatic query reformulation 
Susan Gauch, John B. Smith 

July 1991 ACM Transactions on Information Systems (TOIS), volume 9 issue 3 
Publisher: ACM Press 

Full text available: « odf(2.28 MB) Additional Information: full citation , references , citings, index terms , 
^ review 



Keywords: Expert Systems, full-text information retrieval, online search assistance, 
query reformulation, textbases 
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